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Abstract 

<D . 

We determine the exact freezing threshold, r* , for a family of models of random boolean 
constraint satisfaction problems, including NAE-SAT and hypergraph 2-colouring, when the 
constraint size is sufficiently large. If the constraint-density of a random CSP, F, in our family 
is greater than r* then for almost every solution of F , a linear number of variables are frozen, 
meaning that their colours cannot be changed by a sequence of alterations in which we change 
o(n) variables at a time, always switching to another solution. If the constraint-density is less 
than r* , then almost every solution has o(n) frozen variables. 

Freezing is a key part of the clustering phenomenon that is hypothesized by non-rigorous 
techniques from statistical physics. The understanding of clustering has led to the development 
of advanced heuristics such as Survey Propogation. It has been suggested that the freezing 
threshold is a precise algorithmic barrier: that for densities below r' the random CSPs can be 
solved using very simple algorithms, while for densities above one requires more sophisticated 
techniques in order to deal with frozen clusters. 







1 Introduction 



The clustering phemonenon is arguably the most important development in the study of random 
constraint satisfaction problems (CSP's) over the past decade or so. Statistical physicists have 
discovered that for typical models of random constraint satisfaction problems, the structure of the 
solution space appears to undergo remarkable changes as the constraint density increases. 

A common geometric interpretation of the clustering analysis paints the following picture. Most 
of it is not proven rigorously; in fact many details are not specified precisely. Nevertheless, there 
is evidence that something close to this takes place for many natural random CSP's: At first, all 
solutions are very similar in that we can change any one solution into any other solution via a 
sequence of small local changes; i.e. by changing only o(n) variables-at-a-time, always having a 
satisfying solution. This remains true for almost all solutions until the clustering threshold |42[I43]. 
at which point they shatter into an exponential number of clusters. Roughly speaking: one can 
move from any solution to any other in the same cluster making small local changes, but moving 
from one cluster to another requires changing a linear number of variables in at least one step. As 
we increase the density further, we reach the freezing threshold [51]. Above that point, almost all 
cluster^] contain frozen variables; that is, variables whose values do not change for any solutions in 
the cluster. At higher densities, we find other thresholds, such as the condensation threshold [36] 
above which the largest cluster contains a positive proportion of the solutions. Eventually we reach 
the satisfiability threshold, the point at which there are no solutions. 

The methods that are used to describe these phenomena and determine the values of the thresh- 
olds are mathematically sophisticated, but are typically not rigorous. Nevertheless, they have 
transformed the rigorous study of random CSP's. 

For one thing, this picture explained things that mathematicians had already discovered. For 
some problems (eg. /c-NAE-SAT [TO], fc-SAT [12] and fc-COL [11]) the second moment method 
had been used to prove the existence of solutions at densities that are close to, but not quite, the 
hypothesized satisfiability threshold. We now understand that this is because the way that the 
second moment method was applied cannot work past the condensation threshold. As another 
example, it had long been observed that at a point where the density is still far below the sat- 
isfiability threshold, no algorithms are proven to find solutions for many of the standard random 
CSP models. We now understand [43] that this observed "algorithmic barrier" is asymptotically 
equal to the clustering threshold as k grows ( [3] provides rigorous grounding for this), and so the 
difficulties appear to arise from the onset of clusters. It has been suggested that this algorithmic 
barrier may occur precisely at the freezing threshold; i.e. the formation of clusters does not cause 
substantial algorithmic difficulties until most of the clusters have frozen variables (see section 11.11 
below) . 

Although the picture described above is, for the most part, not established rigorously, under- 
standing it has led to substantial new theorems [1,5,6,20-23,28,29,33,39,48,52]. For example, [23J 
used our understanding of how condensation has foiled previous second moment arguments to 
modify those arguments and obtain a remarkably tight bound on the satisfiability threshold for 
/c-NAE-SAT. [21] used our understanding of clustering to design an algorithm that provably solves 
random /c-SAT up to densities of 0(2 k In k/k), which is the asymptotic value of the clustering 
threshold. A particularly impressive heuristic result is the Survey Propogation algorithm [161143]. 
which experimentally has solved random 3-SAT on 10 7 variables at densities far closer to the satisfi- 

x By this we mean: all but a vanishing proportion of the clusters, when weighted by their size. 
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ability threshold than anyone had previously been able to handle, even on fewer than 1000 variables. 
This algorithm was designed specifically to take advantage of the clustering picture. 

Of course, another thrust has been to try to rigorously establish pieces of the clustering picture 

PUSElEQlHHimilS!!. We haV6 been m ° St successful with ^-XOR-SAT; i.e. a random system of 
boolean linear equations. The satisfiability threshold was established in |27| for k = 3 and in |26| 
for k > 4. More recently, [8,30j each established a very precise description of the clustering picture. 
It should be noted that the solutions of a system of linear equations are very well-understood, 
and that was of tremendous help in the study of the clustering of the solutions. Other CSP's, for 
which we do not have nearly as much control over the solutions, have been much more resistant to 
rigorous analysis; nevertheless, there have been substantial results - see Section [1.21 

The contribution of this paper is to rigorously determine the precise freezing threshold for a 
family of CSP models including /c-NAE-SAT and hypergraph 2-colouring. The freezing threshold 
for fc-COL was determined by the first author in [46J; prior to this work, fc-COL and &-XOR-SAT 
are the only two common models for which the freezing threshold was determined rigorously. 

We follow the approach of [46], but we differ mainly in: (i) Where [46] analyzed the Kempe- 
core, we need to analyze the *-core, which was introduced in [13] to prove the existence of frozen 
variables in random A;- SAT. (ii) Rather than carrying out the analysis for a single model, we carry 
it out simultaneously for a family of models. 

Our informal description of freezing described it in terms of the clusters. At this point, not 
enough information about clustering has been established rigorously to permit us to define freezing 
in those terms. (Eg. we do not know the exact clustering threshold for any interesting model except 
A>XOR-SAT.) So our formal definition of a frozen variable avoids the notion of clustering. 

Definition 1.1. An £-path of solutions of a CSP F is a sequence <to, <7i, at of solutions, where 
for each < i < t — 1, <7j and o~i+\ differ on at most i variables. 

Definition 1.2. Given a solution a of a CSP F, we say that a variable x is ^-frozen with respect 
to a if for every l-path a = o~o,o~i, ...,o~t of solutions of F, we have o~t{x) = o~(x). 

In other words, it is not possible to change the value of v by changing at most t vertices at a 
time. Roughly speaking, the solutions in the same cluster as a are the solutions that can be reached 
by a o(n)-path. So x is o(n)-frozen with respect to a if x has the same value in every solution 
in the same cluster as a. Thus, this definition is essentially equivalent to the informal one if the 
clustering picture is accurate. 

We make critical use of the planted model (section [3]) ; [3] permits us to do so. We prove that 
one can use the planted model up to a certain density, and so we want the freezing threshold to be 
below that density. It will be if the constraint size k is sufficiently large; k > 30 will do. 

We analyze CSP-models satisfying certain properties: non-trivial, feasible, symmetric, balance- 
dominated, and 1-essential (defined in section [2]). The first four are needed to permit the planted 
model; the fifth allows us to focus on the *-core. Given such a CSP model T, we define constants 
r/(T),r p (T) and function A(T,r) below. Our main theorem is that rj(T) is the freezing threshold 
for T and that A(T, r) is the proportion of frozen vertices. We require the density to be below r p (Y) 
in order to apply the planted model. This is not just a technicality - if the density is significantly 
above r p (T), then it will be above the condensation threshold and the expressions that we provide 
will fail to yield the correct constants. 

Given a CSP-model T, C(T,n, M) is a random instance of T on n variables and with M 
constraints (see Section [2|). We say that a property holds w.h.p. (with high probability) if it holds 
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with probability tending to 1 as n —■ oo. 

Theorem 1.3. Consider any non-trivial, feasible, symmetric, balance- dominated, and 1-essential 
CSP-model T with rf(T) < r p (Y). Let a be a uniformly random solution of C(T , n, M = rn). 

(a) For any rf(T) < r < r p (T), there exists a constant < j3 < 1 for which: 

(i) w.h.p. there are X(T,r)n + o(n) variables that are fin-frozen with respect to a. 

(ii) w.h.p. there are (1 — A(T, r))n + o(n) variables that are not 1-frozen with respect to a. 

(b) For any r < ry(T), w.h.p. at most o(n) variables are 1-frozen with respect to a. 

In other words, in a typical solution: for r > rj, a linear number of variables are an- frozen, 
while for r < rf, all but at most o(n) variables are not even 1-frozen. Furthermore, for r > rj we 
specify the number of cm-frozen vertices, up to an additive o(n) term. All but at most o(n) of the 
other vertices are not even 1-frozen. 

We remark that for k-COL and fc-XOR-SAT, we have "u;(n)-frozen" rather than "1-frozen", for 
some io(n) — > oo. Part (b) probably remains true upon replacing "o(n)" with "zero". The o(n) 
terms arises from a limitation of using the planted model. 

For k > 30 we always have ^/(T) < r p (T) (see Proposition 19. 2p and so our theorem applies. 

For densities below the freezing threshold, our proof yields that, in fact, almost all variables 
can be changed via a o(n)-path of length 1: 

Theorem 1.4. Consider any non-trivial, feasible, symmetric, balance- dominated, and 1-essential 
CSP-model T with with rf(Y) < r p (T) Let a be a uniformly random solution of C(T , n, M = rn) 
with r < ry(Y). 

For any u{n) — > oo, w.h.p. for all but at most o(n) variables x, there is a solution a' such that 
(i) o~'(x) ^ s(x) and (ii) a'(x),a(x) differ on at most u(n) variables. 

As mentioned above, our theorems apply to A;-NAE-SAT and hypergraph 2-colouring, two of the 
standard benchmark models. fc-NAE-SAT is a /c-CNF boolean formula which is satisfied if every 
clause contains at least one true literal and at least one false literal. For hypergraph 2-colouring, 
we are presented with a A;-uniform hypergraph and we need to find a boolean assignment to the 
vertices so that no hyperedge contains only vertices of one sign. Thus, it is equivalent to an instance 
of fe-NAE-SAT where every literal is signed positively. See Appendix [8] for a discussion of other 
models to which our theorems apply. 

Physicists tell us that there is a second freezing threshold, above which every solution has frozen 
variables [5T||53] (as opposed to almost every solution as in Theorem ll.3p , [13] proves that this occurs 
in fc-SAT for large enough densities (albeit for a weaker notion of freezing); see Section [1.21 We do 
not see how to determine the exact value of that threshold. 

We should emphasize that the clustering picture described above is very rough. The mathe- 
matical analysis used by statistical physicists to determine the various thresholds actually studies 
properties of certain Gibbs distributions on infinite trees rather than solutions of random CSP's. 
The clustering picture is a common geometric interpretation and it is not exact. Nevertheless, there 
is very strong evidence that something close to this picture should hold. 
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1.1 The algorithmic barrier 

A great deal of the interest in random CSP's arises from the long-established observation that as 
the densities approach the satisfiability threshold, the problems appear to be extremely difficult to 
solve |181l44j. Much work has gone into trying to understand what exactly causes dense problems 
to be so algorithmically challenging (eg. ffi[T{ ^ l2T |l4"3ll4"T] ) . 

It has been suggested (eg. [25l[3ll[35l[5ll[53l[55] ) that, for typical CSP's, the freezing threshold 
forms an algorithmic barrier. For r < rf very simple algorithms (eg. greedy algorithms with minor 
backtracking steps) will w.h.p. find a satisfying solution, but for r > rj one requires much more 
sophisticated algorithms (eg. Survey Propogation). It has been proposed that the following simple 
algorithm should succeed for r <r$ : 

Suppose that Theorem 11.41 were to hold for every solution a. We build our CSP one random 
constraint at a time, letting Fi denote the CSP with i constraints. We begin with a solution <to for 
-^b (°o can be any assignment). Then we obtain cij+i from cxj as follows: If dj does not violate the 
(i + l)st constaint added, then we keep cTj+i = dj. Otherwise, we modify Oi into another solution a' 
of Fi in which the values of the variables in the (i + l)st constraint are changed so that it is satisfied; 
then we set <7j + i = a' . If Theorem 11.41 holds for cij, then we can change each of the k variables 
in that constraint by changing only o(n) other variables. Expansion properties of a random CSP 
imply that these small changes will (usually) not interfere with each other and so we can change 
each of the k variables to whatever we want. Thus we will eventually end up with a solution o~m 
to our random CSP Fm- 

However, Theorem 11.41 does not hold for every solution, only most of them. This is not just a 
limit of our proof techniques - it is believed that it does not hold for an exponentially small, but 
positive, proportion of the solutions. So proving that this algorithm works would require showing 
that we never encounter one of those solutions. 

To see, intuitively, why the onset of freezing may create algorithmic difficulties, consider near- 
solutions - assignments which violate only a small number of constraints, say o(n) of them. The 
near-solutions will also form clusters (because of high energy barriers; see [3]). Furthermore, almost 
all clusters of near-solutions will not contain any solutions. This is because, above the freezing 
threshold, almost all solution clusters have a linear number of frozen variables and so after adding 
only o(n) constraints, we will pick a constraint that violates the frozen variables. This will violate 
all solutions in that cluster, thus forming a near-solution cluster that contains no actual solutions. 
Of course, this description is non-rigorous but it provides a good intuition. 

Now consider a greedy algorithm with backtracking. As it sets its variables, it will approach 
a near-solution p. At that point, it cannot move to a near-solution in a different cluster than p, 
without employing a backtracking step that changes a linear number of variables. So the algorithm 
will need to be sophisticated enough to approach one of the rare near-solution clusters that contains 
solutions. 

As described above, there is a second freezing threshold, above which every cluster has frozen 
variables. [55] suggests that this is another algorithmic barrier above which even the sophisticated 
algorithms fail to find solutions. One indication is that, empirically, every solution a found by 
Survey Propogation is such that no variables are frozen with respect to a. So somehow, the 
algorithm is drawn to those rare unfrozen clusters, and hence may fail when there are no such 
clusters. 
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1.2 Related work 



The clustering picture for /c-NAE-SAT and hypergraph 2-colouring was analyzed non-rigorously 
in [25]. There are hundreds of other papers from the statistical physics community analyzing 
clustering and related matters. Some are listed above; rather than listing more, we refer the reader 
to the book [4"T] . 

Achlioptas and Ricci-Tersenghi |13| were the first to rigorously prove that freezing occurs in a 
random CSP. They studied random fc-SAT and showed that for k > 8, for a wide range of edge- 
densities below the satisfiability threshold and for every satisfying assignment a, the vast majority 
of variables are 1-frozen w.r.t a. They did so by stripping down to the *-core, which inspired us 
to do the same here. One difference between their approach and ours is that the variables of the 
*-core are 1-frozen by definition, whereas much of the work in this paper is devoted to proving 
that, for our models, they are in fact 0(n)-frozen. We expect that our techniques should be able 
to prove that the 1-frozen variables established in |13| are, indeed, 0(n)-frozen. 

[3] proves the asymptotic (in k) density for the appearance of what they call rigid variables 
in /c-COL, /c-NAE-SAT and hypergraph 2-colouring (and proves that this is an upper bound for 
A;-SAT). The definition of rigid is somewhat weaker than frozen, but a simple modification extends 
their proof to show the same for frozen vertices. So [5] provided the asymptotic, in k, location of 
the freezing threhold for those models. |46j provided the exact location of the threshold for /c-COL, 
when k is sufficiently large. 

[3"P |48| establish the existence of what they call cluster-regions for fc-SAT, /c-COL, A;-NAE-SAT 
and hypergraph 2-colouring. [3] proves that by the time the density exceeds (1 + 0^(1)) times the 
hypothesized clustering threshold the solution space w.h.p. shatters into an exponential number of 
G(n)-separated cluster-regions, each containing an exponential number of solutions. While these 
cluster-regions are not shown to be well-connected, the well-connected property does not seem to 
be crucial to the difficulties that clusters pose for algorithms. So this was a very big step towards 
explaining why an algorithmic barrier seems to arise asymptotically (in k) close to the clustering 
threshold. 

[9lll0| provided the first asymptotically tight lower bounds on the satisfiability threshold of k- 
NAE-SAT and hypergraph 2-colouring, achieving a bound that is roughly equal to the condensation 
threshold. [24] provides an even stronger bound for hypergraph 2-colouring, extending above the 
condensation threshold. [23j provides a remarkably strong bound for A;-NAE-SAT - the difference 
between their upper and lower bounds decreases exponentially with k. 

2 CSP models 

A boolean constraint of arity k consists of k ordered variables (x\, . . . ,Xk) together with a boolean 
function tp : {— 1, l} fc — > {0, 1}. This function constrains the set of variables to take values a = 
(oi, . . . ,o"fc) € {—1, l} fc such that </?(<ti, . . . , Ofc) = 1. We say that the constraint is satisfied by a 
boolean assignment a if it evaluates to 1 on a. 

A constraint satisfaction problem (CSP) is a set of constraints, where the a th constraint is 
formed by a boolean function ip a over the variables (xi 1 a , . . . , Xi k J, with i j>a G [n]. A CSP, H, 
defines a boolean function F^ H ' : {—1,1}" — > {0, 1} given by 

(oi, . . . , a n ) := Jl (p a (cx il>a a ika ) . 

a 
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Given a £ {— 1, l} n , we say that a is a satisfying assignment, or solution, of the CSP H if a satisfies 
every constraint of H; i.e. if F^ H \a) = 1. 

A CSP model is a set $ of boolean functions, together with a probability distribution p : $ — > 
[0, 1] defined on it (we assume implicitly that the support of p is 3>). Our random CSPs are: 

Definition 2.1. Given a CSP model T = a random CSP ; C(T,n, M), is a CSP over the 

variables {x\, . . . ,x n } consisting of M constraints {ip a (xi 1 a , . . . , Xi k a ) : a = 1,...,M} where the 
boolean constraints {<p a : a = 1, . . . ,M} are drawn independently from <I> according to the distri- 
bution p, and the k-tuples {(x^ a , ...,Xi k a ) : a = 1, . . . , m} are drawn uniformly and independently 
from the set of k-tuples of {x\, . . . , x n }. 

We consider random CSP-models T = (3>,p) with the following properties. 

Definition 2.2. 

Non-trivial: There is at least one <p £ $ that is not satisfied by x\ = ... = X\~ = 1 and at least one 
ip £ that is not satisfied by x\ = ... = Xk = — 1- 

Feasible: For any p £ $, and every assignment to any k — 1 of the variables, at least one of 
the two possible assignments to the remaining variable will result in <p being satisfied. 

Symmetric: For every ip £ $, and for every assignment x, we have (p(x) = ip{—x), where —x is 
the assignment obtained from x by reversing the assignment to each variable. 

Balance- dominated Consider a random assignment a where each variable is independently set to 
be 1 with probability q and -1 with probability 1 — q, and let ip be a random constraint from $ with 
distribution p. The probability that a satisfies ip is maximized at q = \. 

Those four properties will allow us to apply the planted model. 'Non-trivial' is a standard 
property to require. 'Feasible' is also quite natural, although some common models do not satisfy 
it. The other two properties help us to bound the second moment of the number of solutions, which 
in turn enables us to use the planted model. 

Our final property allows us to analyze frozen variables using the *-core. 

Definition 2.3. 1-essential: Given a boolean constraint ip and an assignment a that satisfies 
cp, we say that the variable x is essential for {ip, a) if changing the value of x results in ip being 
unsatisfied. We say that a set $ of constraints is 1-essential if for every and every a 

satisfying ip, at most one variable is essential for (ip,o~). A CSP-model is 1-essential if is 

1-essential. A CSP is 1-essential if all of its constraints are 1-essential. 

For example: in hypergraph 2-colouring, x is essential iff its value is different from that of every 
other variable in 4>; in /c-XOR-SAT, every variable is essential. It is easily confirmed that for k > 3: 
fc-SAT, hypergraph 2-colouring and fc-NAE-SAT are 1-essential, but /c-XOR-SAT is not. 

3 The planted model 

Consider any CSP-model T = (&,p). Theorem 11.31 concerns a uniformly random satisfying assign- 
ment of C(T ,n, M); i.e. a pair (F,a) drawn from: 
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Definition 3.1. The uniform model U(T, n, M) is a random pair (F, a) where F is taken from the 
C(T,n, M) model and a is a uniformly random satisfying solution of F. 

The uniform model is very difficult to analyze directly. So instead we turn to the much more 
amenable planted model: 

Definition 3.2. The planted model P(T,n, M) is a random pair (F,a) chosen as follows: Take 
a uniformly random assignment a £ {— l,+l} n . Next select a random F drawn from C(T,n, M) 
conditional on a satisfying F. 

Remark: Note that we can select F by choosing M independent constraints. Each time, we 
choose a uniformly random fc-tuple of k variables, then choose for those variables a constraint 
ip 6 <J? with probability distribution p. If a does not satisfy the constraint then reject and choose 
a new one. Equivalently, we can choose the /c-tuples non-uniformly where the probability that a 
particular /c-tuple is chosen is proportional to the probability that, upon choosing ip for that set, 
the constraint will be satisfied by a. Then we choose <p € <J> with probability p conditional on a 
satisfying ip. 

It is not hard to see that the uniform and planted models are not equivalent. In the planted 
model, a CSP is selected with probability roughly proportional to the number of satisfying assign- 
ments. Nevertheless, Achlioptas and Coja-Oghlan [3] proved that, under certain conditions, one 
can transfer results about the planted model to the uniform model when T is &-COL, fc-NAE-SAT 
or hypergraph 2-colouring (also A;- SAT, but under stronger conditions). Montanari, Restrepo and 
Tetali |48| extended this to all T in a class of CSP-models, including all models that are non-trivial, 
feasible, symmetric, and balance-dominated. 

For each non-trivial, feasible, symmetric and balance-dominated CSP-model T we define (in 
Appendix [8]) a constant r p (T), which is the highest density for which we can use the planted 
model. The following key tool essentially follows from Theorem B.3 of |48| . except that they do 
not explicitly mention r p (T), instead giving an implicit lower bound under appropriate conditions. 
It was first proven in [3] for NAE-SAT, hypergraph 2-COL and a few other models. 

Lemma 3.3. Consider any non-trivial, feasible, symmetric, and balance- dominated CSP-model T. 
For every r < r p (T), there is a function g(n) = o(n) such that: Let £ be any property of pairs 
(F, a) where a is a satisfying solution of F. If 

Pr(P(T, n,M = rn) has £) > 1 - e~ 9(n) , 

then 

Pr(f/(T, n, M = rn) has £) > 1 - o(l). 

In Appendix [8l we prove that if T is also 1-essential, then for k > 30, we have r p (T) > r/(T) 
and so Theorem ll.3l is non- trivial. In fact, r p (T) = S(i^)r/(T). The bound k > 30 can be lowered, 
and for some specific models T it can be lowered significantly. For example, for /c-NAE-SAT and 
hypergraph 2-colouring, one can probably prove that k > 6 will do. 

4 The *-core 

The *-core was introduced in |13j to study frozen variables in random fc-SAT. 
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Fix a satisfying assignment a, and consider a variable x. Suppose that there are no constraints 
(p such that x is essential for (ip, a). Then, by the definition of essential, we can change x and still 
have a satisfying assignment. So x is not frozen. This inspires the following: 

Definition 4.1. Consider a CSP F with a satisfying assignment a. The *-core of (F,a) is the 
sub- CSP formed as follows: 

Iteratively remove every variable x such that for every constraint ip: x is not essential for (ip,a). 
When we remove a variable, we also remove all constraints containing that variable. 

Note that the order in which variables are deleted will not affect the outcome of the iterative 
procedure. So the *-core is well-defined, albeit possibly empty. 

As described above, it is clear that the first variable removed is not frozen. Expansion properties 
of a random CSP - in particular the fact that it is locally tree-like - imply that almost every variable 
removed is not frozen. Furthermore, we will prove that if the model is 1-essential then almost all 
variables that remain in the *-core are frozen. Having proven those two key results, Theorem 11.31 
follows from an analysis of the *-core process. 

Now suppose that our CSP-model is 1-essential. A key observation is that the *-core depends 
only on the constraints that have essential variables. I.e., if we first remove all constraints with 
no essential variables from the CSP and then apply the *-core process, the set of variables in the 
resultant *-core will not change. 

Definition 4.2. Given a 1-essential CSP, F, and a satisfying solution a, we define the hypergraph 
r(F, a) as follows: The vertices are the variables of F and the variables of each constraint of F 
form a hyperedge, if that constraint has an essential variable. That essential variable is called the 
essential vertex of the hyperedge. 

Note that we can find the *-core of (F, a) by repeatedly deleting from T{F, a) vertices that are 
not essential in any hyperedges, along with all hyperedges containing the deleted vertices. The 
resulting hypergraph is called the *-core ofY(F,a). 

The precise model for the random hypergraph T(F, a) varies with T (see appendixllOp. However, 
the size of the *-core as a function of the number of hyperedges is the same for all such models. 

We define: 

x 

ak := in £7i -x\k-i - 

x>o (1 — e X ) K 1 

Also, for a > ak, let Xk(a) be the maximum value of x > such that ^ 1 _ e _ :E )fc-i = a an d set 

p k (a) = l-e- x ^ a \ 

In Appendix 1111 we prove 

Lemma 4.3. Consider any 1-essential CSP-model T = (&,p) of arity k, and a random CSP, F, 
drawn from P(T,n,M = rn). Suppose T(F,o~) has an + o(n) hyperedges. For any g(n) = o(n), 
with probability at least 1 — e~ 9 ^ : 

(a) If a > ak then the *-core ofT(F,o~) has pk{a)n + o(n) vertices. 

(b) If a < ak then the *-core ofT(F,o~) has o(n) vertices. 
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This allows us to analyze our family of models simultaneously by working directly with the 
*-core of r(i ? , a). We prove that almost all vertices of the *-core are 0(n)-frozen variables in F 
and almost all vertices outside of the *-core are not even 1-frozen in F. 

In Appendix [9l we define for any 1-essential CSP-model T = ($>,p), a constant £(T) > and 
prove: 

Lemma 4.4. For any g(n) = o(n) and r > 0, with probability at least 1 — e~ 9 ^ n \ the number of 
constraints in P(T,n,M = rn) that have an essential variable is £(T)rn + o(n). 

This yields Theorem 11.31 (see appendix fT0|) with: 

r/(T) = a fc /£(T); A(T, r) = p fc (£(T)r). 

In Appendix[10l we describe the models that we use to analyze T(F, a) and the *-core of T(F, a). 

5 Unfrozen variables outside of the *-core 

Let x be a vertex of T(F,a) which is not in the *-core of T(F,a). We will consider how x can be 
removed during the peeling process used to find the *-core of T(F, a). More specifically, we consider 
a sequence of vertices, culminating in x, which could be removed in sequence by the peeling process. 

Definition 5.1. A peeling chain for a vertex x G T(F,a) is a sequence of vertices x\,...,xe = x 
such that each Xi is not essential for any hyperedges in the hypergraph remaining after removing 
x%, ...,Xi-\ from T(F, a). The depth of the chain is the maximum distance from one of the vertices 
to x. The *-depth of x is the minimum depth over all peeling chains for x. 

In Appendix II 11 we will prove: 

Lemma 5.2. Consider any non-trivial, feasible, symmetric, balance- dominated, and 1-essential 
CSP-model T. Let (F,a) be drawn from the planted model P(T,n,M = rn) where r ^ rj(T). 

For any e > 0, there exists constant L such that: For all g{n) = o{n), the probability that at 
least en vertices ofT(F,o~) that are not in the *-core ofT(F,o~) have *-depth greater than L is less 
than e" 9 K 

This is enough to prove that all but o(n) variables outside the *-core are 1-frozen as follows: 
Proof outline of Theorem \1.3\f a. ii. b): 

Consider any e > 0. If (F,a) is drawn from the planted model then, by Lemma 15.21 T(F, a) 
has fewer than en vertices of *-depth greater than L with probability at least 1 — e~ 9 ^ n \ So for 
r < r p (T), Lemma 13.31 yields that the same is true w.h.p. when (F,a) is drawn from the uniform 
model. 

Consider any vertex x of *-depth at most L. Consider a peeling chain for x of depth at most L 
and let W be the set of all hyperedges that contain at least one vertex of the peeling chain. 

If no hyperedges of W form a cycle, then it is easy to see that we can change all of the variables 
in the peeling chain, one-at-a-time and still have a satisfying assignment for F. Indeed, this follows 
from a straightforward induction on L. Therefore, the variable x is not 1-frozen. The case where 
W contains a cycle is rare enough to be negligible (see Appendix 112. 2p . So for all e > there are 
fewer than en variables outside of the *-core that are not 1-frozen, as required. □ 

This argument also leads to: 
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Proof outline of Theorem \1.3l This theorem follows as above, by adding the observation that 
with sufficiently high probability, almost all vertices outside the *-core have a peeling chain of size 
0(1). We can change the corresponding variable by changing a subset of the entire peeling chain. 
See Appendix [10] for the short proof. □ 



6 Frozen variables in the *-core 

Most of the work in this paper is in proving that almost all vertices in the *-core of T(F,o~) are 
B(n)-frozen. To do so, we first study the structure of sets of variables that can be changed to 
obtain a new solution. Note that if changing the value of every variable of S yields a solution, then 
every constraint whose essential variable is in S must contain at least one other variable in S. This 
leads us to define: 

Definition 6.1. A flippable set of the *-core ofT(F,a) is a set of vertices S such that for every 
x € S and every *-core hyperedge f in which x is essential, S contains another vertex of f . 

For every vertex x € S, since x is in the *-core, there will be at least one such hyperedge /. 

Note: if S is a flippable set in T(F, a), then changing the variables of F corresponding to S will 
not necessarily yield another solution; this will depend on the actual constraints of F. But it is 
easily seen that the converse holds: 

Proposition 6.2. Ifcr,a' are two solutions to a 1-essential CSP, F, then the set of *-core variables 
on which they differ form a flippable set in T(F,a). 

Proof. Let S be the set of variables in the *-core of (F, a) on which a, a' differ. Suppose that S 
does not form a flippable set in T(F, a). Then there is a variable x 6 S and a *-core hyperedge e in 
which x is essential, such that e contains no other members of S. The hyperedge e corresponds to a 
constraint in F. In that constrain, the solutions a, a' agree on all variables but x, which contradicts 
the fact that x is essential for e. □ 

We prove that for some <j>'{n) = o(n) and constant Q > 0, with sufficiently high probabilty, there 
are no flippable sets of size 4>'{n) < a < £n. This will be enough to prove that at most o(n) vertices 
lie in flippable sets, which in turn will be enough to show that almost all of the *-core is frozen. 

We apply the first moment method. Unfortunately, we cannot apply it directly to the number 
of flippable sets because the existence of one flippable set S typically leads to the existence of an 
exponential number of flippable sets formed by adding to S vertices x such that (i) x is essential 
in exactly one hyperedge, and (ii) that hyperedge contains a non-essential vertex in S. So instead 
we focus on something that we call weakly flippable sets, which do not contain such vertices x. 
Roughly speaking: every flippable set can be formed from a weakly flippable set by repeatedly 
adding vertices x in that manner. We prove that with sufficently high probability: 

(a) There are no weakly flippable sets of size cp(n) < a < Qn. 

(b) There are no weakly flippable sets of size at most 4>(n) which extend to a flippable set of size 
greater than (f>'(n). 
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This establishes our bound on the sizes of flippable sets. (This is not quite true - we also need to 
consider cyclic sets - but it provides a good intuition.) 

Let Hi denote the vertices that are essential in exactly one hyperedge. Define a one-path to be 
a sequence of vertices x\, ...,xt+i such that for each 1 < i < t: Xi G H\ and Xj+i is in the hyperedge 
in which Xi is essential. Note that if xt+i is in a flippable set S, then we can add the entire one- 
path to S and it will still be flippable. This ends up implying that if we have a proliferation of 
long one-paths, then we would not be able to prove (b). It turns out that a proliferation of long 
one-paths would also prevent us from proving (a). 

Consider a vertex x € Hi and the edge / in which x is essential. Intutively, the expected 
number of other members of Hi that are in / is (k — l)\Hi\ divided by the size of the *-core. We 
prove (Lemma 110.31) that this ratio is less than 1. This implies that one-paths do not "branch" 
and so we do not tend to get many long one-paths. So our bound on this ratio plays a key role in 
establishing both (a) and (b). 

This is just an intuition. In fact, one-paths are not explicitly mentioned anywhere in the proofs. 
For all the details, see Appendix [T2l 

7 Further Challenges 

Of course, one ongoing challenge is to continue to rigorously establish parts of the clustering picture. 
By now, it is clear that in order to establish satisfiability thresholds or understand the algorithmic 
challenges for problems with densities approaching that threshold, we will probably need a strong 
understanding of clustering. 

Another challenge is to try to establish whether the freezing threshold is, indeed, an algorithmic 
barrier. For several CSP-models, we now know the precise location of that threshold, and we have 
a very good understanding of how it arises and which variables are frozen. Perhaps we can use that 
understanding to prove that a simple algorithm works for all densities up to that threshold and/or 
establish that frozen clusters will indeed neccesitate more sophistication. 

Another challenge is to determine the freezing threshold for a wider variety of CSP-models. 
These techniques rely crucially on using the planted model; at this point there is no known way 
to get to the exact threshold without it. This prevents us from extending our results to A:-SAT 
and many other models as the planted model does not work nearly well enough, mainly because 
the number of solutions is not sufficiently concentrated. A more important challenge would be to 
devise a better means to analyze random solutions to CSP's drawn from those models. 
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Appendix 



8 The transfer theorem 

Let us consider a CSP-model T = Let us recall the properties from Definition 12.21 Given 

a boolean function p £ <5, we denote by S v the set of satisfying assignments of (p and also we 

define I v := {-l,l} fc \ S v . Now, let <p[x) = E (<PqU i£Q x i) be its Fourier expansion. 



Such expansion is unique with pq := ^ \ ( P( x )Y[ieQ Xi ) ■ m particular, it is the case that 
^0 = ^fj~ = E <Pq- Moreover, if p is symmetric, we have that p^y = (In fact, ipq = 

QC{-1,1}* 

whenever \Q\ is odd). Now, we define the polynomial as follows, 

QC{-1,1}*= 

Also, we define the binary entropy function H(0) as 

H(f) := In(l + 0) - ^ ln(l - 0) 

Finally, we define 

r p (T) := mf 



^6(0,1) £<pe**V m (^(^))' 

We will now prove Lemma 13.31 which we restate: 
Lemma 13.31 Consider any non-trivial, feasible, symmetric, and balance- dominated CSP-model T. 
For every r < r p (T), there is a function g{n) = o(n) such that: Let 6 be any property of pairs 
(F,a) where a is a satisfying solution of F. If 

Pr(P(T, n,M = rn) has £) > 1 - e^ (n) , 

then 

Pr(J7(T, n,M = rn) has £) > 1 - o(l). 



The proof follows the argument employed in |48] to prove Theorem B.3, which followed the 
same spirit of similar results in |3j. 

Proof. In what follows, we will take expectations over a random p> chosen from $ with distribution 
p. Thus, for a variable X(ip), we have Exp(X) = E^e^M^)^^)- Let £^ be the number of clauses 
with constraint p in the random CSP H drawn from T. Let 7 be a fixed constant in (0, 1/2) and 
let T be the event 'For all p G <I>, — ap v n\ < n 1//2+7 '. So, T holds w.h.p. 

We say that a solution a is balanced if the number of variables assigned +1 is either [^] or [§J • 
Let Zi, be the number of balanced solutions of H , let Z be the number of solutions of H and let 
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Z b {9) be the number of pairs of balanced solutions x^-\x^ 2 ' with discrepancy 9, that is, such that 
h W = 0. Now, 

Exp[Z b 2 I(J-)] _ ^ Exp [Z 6 (0)1(7-)] 



(Exp[Z fe I(J-)]) 2 ^ (Exp[Z b I(^)]) 2 
where f/ n := {i/n : i = — n, . . . , n}. From lemma A. 2 in [35], then it is the case that 

fsxpfzli^)]) 2 - Cn ~ 1/2 exp (n(H(e) + « Ex P[ ln M^))] + 

where C does not depends on 9 (neither the o(l) term). 
Now, if a < r p (T), it is the case that 

H(0) + aExppn(p v (0))] < for all 9 e (0, 1). (1) 

On the other hand, since T is symmetric, 



//(«)|nExp|]nl /V (W))] - ( ~ + aExp 



[01=2 



+ O(0 4 ). 



Now, since 



1/2 

lim — , = ? - > a, 



9^o Exppn(p v (0))] Exp \^ lQl=2 ^ Q / n )2 



then it is the case H(9) + aExp[ln(p ¥ ,(^))] < — c9 2 for some c > and close enough to 0. 
Combining this fact with eq. (pQ), we have that for some d > 0, 

H(9) + aExp[ln(pp(0))] < -c'9 2 for all G (0, 1). (2) 

Now, 

Exp[Z 2 imi C . , , - 

(Exp[Z 6 I(J r )]) i n 1 ' 2 f-£ n 

POO 

< Cn 1 ' 2 exp(-c'n(# 2 + o(l))) (4) 



And the last quantity is bounded by a constant Co (not depending on n). This implies, by the Paley- 
Zygmund inequality, that for every e > and all n > no it is the case that Pr(Z;, > e _ne Exp[Z;,]) > 
Co/2. 

Now, because T is balance-dominated, we have that Exp[Z] < nExp[Z&]. Therefore, for n 
large enough, we have that 

Pr(Z > e- ne Exp[Z]) > Pr(Z b > ne- ne Exp[Z b ]) > Pv(Z b > e - n(e/2) Exp[Z 6 ]) > C /2. 

On the other hand, it is easy to see that Exp[Z] is exponential in n for a < r p (T) (Indeed Exp[Z] 

is exponential for a < r sat (T) := !° 2 , T , nry ... = *p ft^t^ttttt)- Now, let us recall from 

v ' Exp JM\+\i v \l\s v \)\ Exp[in(p ¥ ,(i))] > 
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Appendix C in [JB], that the event 'Z > B n \ where B > 1, has a sharp threshold in the clauses 
to variables ratio. Thus, the event l Z > e _ne Exp[Z]' has a sharp threshold in the parameter a. 
Therefore, necessarily, it is the case that Z > e~ ne Exp[Z] w.h.p.. This implies therefore, that for 
some function g(n) of order o(n), it is the case that w.h.p., 

ln{Z) > ln(Exp(Z)) - g{n). (5) 

After this equation is established now the lemma follows. For instance, from Theorem B.3 in 

□ 



Now, recall our other property: 

1-essential: Given a boolean constraint ip and an assignment a that satisfies ip, we say that 
the variable x is essential for (ip, a) if changing the value of x results in ip being unsatisfied. We 
say that a set $ of constraints is 1-essential if for every ip € and every a satisfying ip, at most 
one variable is essential for (ip,a). A CSP-model (&,p) is 1-essential if & is 1-essential. 

An easy description of a feasible, 1-essential constraint is the following: ip is feasible and 1- 
essential iff the Hamming distance between any pair of assignments in I v is greater than 2. This 
implies in particular that \I V \ < ,3 and *P{ij} = — or ^Cxe/ x % x y This allows us to prove a 

more concrete lower bound on the transfer threshold r p (T) that we will use in the next section to 
establish that r p (Y) is above the freezing threshold for large enough k. 

Theorem 8.1. Consider any non-trivial, feasible, symmetric, balance- dominated and 1-essential 
CSP-model T. It is the case that 

rp[ > ~ fip(T)' 

where 

U P (T) :=Exp^|/|^|]. 
Proof. Since every constraint ip € <3? is feasible and 1-essential, we have that 



2 



\2 

T,xei v x i x i j „ fk\ ( 11^ I \ 2 



, . ., >, <P% J fri, |SVI 2 " V 2 / \\ S f\ 



Therefore, since 




we have that 



tp\J \S<p\ 



And, since \I V \ < tm— [ 1 an d therefore (2) fj^jj — |5|> we get that 



Pm < 1 + 

1 ^<p 1 
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Thus, 

Exp^ln^))] < 26 2 n p (T) 

Now, we finally conclude that 

r (T) = inf > inf = (6) 

P j 96(0,1) Exp^ln^))] " flp(T) 96(0,1) # 2 " n p (T)' 1 J 

□ 

We close this section by discussing the CSP-models that satisfy our five conditions: non-trivial, 
feasible, symmetric, balance-dominated, and 1-essential. 

Our properties are rich enough to permit a large class of CSP-models beyond hypergraph 2- 
coloring and fc-NAE-SAT. For example, we can construct a model in the following way: 

Represent the assignments in {— 1, + l} k as the /c-dimensional hypercube H^, and so two assign- 
ments are adjacent if they differ on exactly one variable. Let L e denote the vertices x G with 
^2 Xk > ek. Consider any subset I C L e containing no two vertices of distance at most two. We 
use —I to denote the subset formed by switching the sign of every vertex in /, and set J := ID —I 
to be the assignments which violate our constraint p>j. I.e., <pj(x) := 1 iff x £ J. 

Now consider any set <E> of constraints of this form in which at least one is non-trivial (i.e. has 
(1, 1, 1) G J). Let T = (3>,p) for any p (such that supp(p) = For any k large enough in 
terms of e, $ satisfies our five properties. For instance, hypergraph 2-coloring is formed in this way 
with I := (l,...l). 

Given a constraint p and some s G { — 1, +l} fc , we define the constraint ip s as <p s (xi, ...,£&) = 
ip(s±xi, SfcXfc). We can allow e = and drop the condition that k must be large if (a) no two 
vertices of J are within distance 2, and (b) for every ip G $ and every s G {— l,+l} fc , we have 
(p s G and p{tp s ) = p(f)- For instance, A;-NAE-SAT is formed in this way with / := (1, 1). 



9 Essential hyper edges 

Consider any nontrivial, feasible, symmetric 1-essential CSP-model T = ( < l ) ,p). We will draw 
(F,a) from the planted model P(T,n,M). We begin by taking a random assignment a for the 
variables x±, ...,x n and note that |A + |, |A~| = \n + o(n) with probability at least 1 — e~ 9<yn \ for 
any g{n) = o(n). So we can assume that this condition holds. 

In what follows, we will take expectations over a random (p chosen from with distribution p. 
Thus, for a variable X{ip), we have Exp(A) = ^2^$ p(<p)X ((p) . 

For every tp G recall from the previous section that S<p is the set of assigments in {— 1, +l} fc 
that satisfy p and l v = is the set that do not satisfy p. We define S* C S v to be the set of 
assignments that satisfy p and for which p has an essential variable. Noting that switching the 
essential variable of an assignment in yields an assignment in i^, and using the fact that T is 
feasible, it is easy to see that \S^\ = k\I v \. 

Since |A + |, |A _ | = \n + o(n), it follows that when picking a constraint in the planted model, 

we choose p with probability proportional to p(p)\S (p \ + o(l). Thus, defining Qf := | g*^ | ' 
probability that p has an essential variable is: 

C(T) = fcn / + o(i). 
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So the number of constraints that have an essential variable is distributed as the binomial 
BIN(M = rn, £(T)). Concentration of the binomial variable implies Lemma 14.41 

Now recall the type of (p, as defined in Section flO-ll For a constraint 9? G $, define I v {a,b) := 
{x £ I v : x has a l's and b — l's} then the clause (p has exactly (b + l^i^a, 6 + 1)| assignments 
of type (1; a, b) and (a + l)\I v [a + 1, 6)| assignments of type (—1; a, b). Therefore, when picking a 
constraint in the planted model, if we condition on the event that it has an essential variable, then 
the conditional probability that it has type r = (1; a, b) is 

_ (6 + l)Exp[|J y (a,b + l)|] 



and to be of type r = (—1; a, b) is 



(a + l)Exp[|/ ¥ ,(a + 1,6)|] 



7t = + 

Since T is symmetric, <p(x) = ip(—x) for every assignment x. It follows that ^(a, 6)| = \I,p(b,a)\ 
and therefore 7 r= (i ;aj 6) = 7 r =(-i;&,a) + So, noting that we can exchange a, 6 in the following 

definition: 

7 + := 7t ' 7 ~ := 7t ' 

r=(l,a,6) r=(-l,a,&) 
we have 7 + = 7"" = \ + o(l). In other words: 

Lemma 9.1. When we choose a random clause for the planted model, and condition on it having 
an essential variable: the probability that the essential variable is in A + is equal to the probability 
that it is in A~ plus o(l). 

We close this section by showing that r/(T) < r p (T) for sufficiently large k. 

Proposition 9.2. For any nontrivial, symmetric, feasible, balance-dominated, 1 essential CSP 
model T of arity k: 

(a) For every k > 27, r p (T) > r/(T). 

(b) Asymptotically in k, < 
Proof. Notice first that 



£l p = Exp 



I "-V I 



< Exp[|/ y j] < Exp[|/J Qj 



^(l-^-d-^ExpHM (1-^) 

TV,, 

(l-l/k 2 ) k - 



Notice also that < q J^/fc^-i • Therefore, since 



21n ( fc ) < ( i/4)(l - -J— 

fc (i-i/fc2)fc-i -ww ( ^ ) + r 



^2 

for > 27, then 



21n(fc) (1/4) 
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by Theorem 18.11 Then, part (a) follows. To prove part (b) we use the previous inequality, so that 



r f( T ) 81n(ife) , „ WI 

< 1 — ~ 8ln( k) k 

r p (T) ~ fc(i_^_)(i_i/fca)fc-i 1 " 



□ 



10 The *-core 

Lemma 10.1. Consider any 1-essential CSP-model T = of arity k, and a random CSP, F , 

drawn from P(T,n,M = rn). Suppose T(F,a) has an + o(n) hyperedges with a ^ a.}.. For any 
</(n) = o(n) and constant e > 0, there exist constants T, Z, (3 > such that, with probability at least 
1 - e~ 9 ^ : 

(a) All but o(n) vertices of the *-core ofT(F,a) are fin-frozen variables for (F,o~). 

(b) All but at most en vertices outside the *-core ofT(F,a) are either (i) not T -frozen variables 
for (F,o~) or (ii) within distance Z from a cycle of length at most Z. 

This yields Theorem 11.31 

Proof of Theorem \1.3l Consider (F, a) drawn from the uniform model U(T, n,M = rn). A sim- 
ple first moment calculation shows that the expected number of variables that are within distance 
Z of a cycle of length at most Z in the underlying hypergraph of F is 0(1). Therefore w.h.p. there 
are o(n) such vertices. 

For part (b): If r > r/(T) then a > at- Consider any e > 0. Lemma 13.31 allows us to transfer 
Lemmas 14. 3| 110. H 14.41 to (F, a) to establish that w.h.p. all but at most en variables are either 
T-frozen with respect to a or are within distance Z of a cycle of length at most Z. W.h.p. there 
are o(n) variables of the latter type, and so all but at most en + o(n) vertices are T-frozen. By 
letting T tend to infinity we can take e arbitrarily small thus obtaining part (b). 

For part (a): If r > r/(T) then a < a/-. Again, we transfer Lemmas 14.31 fTOTT] 14.41 to (F,o~). 
This shows that w.h.p. all but o(n) of the vertices of the *-core are frozen. The same argument as 
for part (b) shows that w.h.p. all but o(n) of the vertices outside of the *-core are frozen. Part (a) 
follows since A(T,r) = /9j.(£(T)r) = pk(ot) and w.h.p. the size of the *-core is pk{ct)n + o(n). □ 

Lemma 110. 1( a) is proven in Section [12j Lemma llO.l( b) follows from Lemma 15.21 as follows: 
Proof of Lemma \10.lV b): Consider any e > 0. With probability at least 1 — e~ 9<yn \ T(F,a) has 
fewer than en vertices of *-depth greater than L, where L comes from Lemma 15.21 Consider any 
vertex x of *-depth at most L. Consider a peeling chain for x of depth at most L and let W be the 
set of all hyperedges that contain at least one vertex of the peeling chain. 

If some hyperedges of W form a cycle, then there must be a cycle of length at most 2L within 
distance L of x. If no hyperedges of W form a cycle, then it is easy to see that we can change 
all of the variables in the peeling chain, one-at-a-time and still have a satisfying assignment for 
F. Indeed, this follows from a straightforward induction on L. Therefore, the variable x is not 
1-frozen. □ 
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10.1 Our hypergraph models 

Consider any 1-essential CSP, F, and any solution a. 

The vertices of T(F,a) are partitioned into two sets A + , A~ containing those variables which 
are assigned +1, —1 respectively under a. 

Definition 10.2. For each hyperedge e G T(F,o~): Let a be the number of non-essential vertices of 
e in A + and let b be the number of non-essential vertices of e in A - . The type of e is defined to be: 

• (l,a, b) if the essential vertex vertex of e is in A + ; 

• (— 1, a, b), if the essential vertex vertex of e is in A~. 

The type of a constraint of (F, a) with an essential vertex, is the type of the corresponding hyperedge 
in r(F, a). 

Now consider a nontrivial, feasible, symmetric, balance-dominated, 1-essential CSP-model T 
and choose a random (F,o~) from the planted model P(T,n, M). Recalling the Remark following 
Definition 13.21 we can selected the constraints of F independently. Given the partition A + , A - , and 
a type r, we let w(t) = w(t, A + , A - ) denote the probability that a selected constraint has type r, 
conditional on it having an essential vertex. (See Appendix [9] for further discussion.) Note that 
w(t) depends only on T, |A + |, |A~|. Note further that, conditional on a hyperedge e having type 
r, every choice of the vertices of e which is consistent with r is equally likely. Thus, when choosing 
T(F, a) we can choose the type of a hyperedge first and then its vertices. This leads us to: 
Model A: 

1. Partition the vertices into A + , A~ uniformly at random. 

2. For i = 1 to M, choose the ith hyperedge ej as follows: 

(a) Choose the type (s, a, b) of ej (where s £ {+1, —1}), where type r is chosen with proba- 
bility w(t). 

(b) Choose the essential vertex for uniformly from the appropriate set, A + or A - , accord- 
ing to s. 

(c) Choose a vertices uniformly from A + and b vertices uniformly from A~. These are the 
non-essential vertices of ej. 

In some cases, it will be useful to fix the essential vertex of every hyperedge, along with the 
assignment a, and then choose our planted hypergraph. In this case, for s G {— 1,+1}, we use 
w s (t) = w(t, A + , A - ) denote the probability that a selected constraint has type r, conditional on 
it having an essential vertex in A s . We can use the following model: 
The Essential Model: 

1 . We are given a partition the vertices into A + , A~ . 

2. For i = 1 to M, we are given the essential vertex of ej. We choose the rest of C{ as follows: 

(a) Choose the type (s, a, b) of e^, where s is already determined and type r is chosen with 
probability w s {t). 
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(b) Choose o vertices uniformly from A + and b vertices uniformly from A . These are the 
non-essential vertices of 

The essential model will be useful in analyzing the *-core of T(F, a). 

We let H\ denote the set of vertices v £ H* that are essential in exactly one hyperedge. We 
use Hf , H± to denote H± n A + , H\ n A~ , the vertices of H\ corresponding to variables assigned 
+1, —1 by a. The following lemma will be key in proving that most of H* is frozen: 

Lemma 10.3. // T is non-trivial, feasible, symmetric, and balance- dominated and if a > then 
there exists 7 = j(Y,a) > such that: for any g{n) = o(n), with probability at least 1 — e~ 9 ^ n \ 

(a) \V(H*) n A+|, \V(H*) n A- 1 = \V{H*)\(\ + o(l)); 

(b) \H+\,\Hi\<3^\ V (H*)\. 

The proof appears in Appendix II 1[ 
We close this section with: 

Proof of Theorem \1.3i Since r < r/(T), w.h.p. the *-core is empty. During the proof of 
Lemma 111.21 in Appendix [TT] we prove that for D sufficiently large, with probability at least 

I — e~ 9 ( n \ fewer than en vertices are within distance L of a vertex with degree greater than D. It 
follows that for all but at most en vertices of depth at most /, the size of their peeling chain is at 
most (kD) 1 = O(l). We can change any such variable by changing a subset of the entire peeling 
chain in one step. So, applying Lemma 15.21 we see that for all but 2en variables v, we can change 
v by changing at most (kD) 1 variables. 

We use Lemma 13.31 to show that this holds w.h.p. in the uniform model. Then by taking D 
arbitrarily large and e arbitarily small, we obtain the theorem. □ 

II Analysis of the *-core process 

Recall that T is a non-trivial, feasible, symmetric, balance-dominated, and 1-essential CSP-modcl, 
and that we draw (F, a) from the planted model. 

Let H denote the hypergraph T(F, a). H has M = an edges. We will analyze the *-core process 
on H (recall Section 2|) . We follow the analysis of [45] , being careful to obtain a failure probability 
of at most e~ g ^ for any g(n) = o(n); alternatively, we could have followed the analysis of |32j . 

Recall that A + , A~ denotes the sets of vertices corresponding to variables of sign +1, —1 in a. 
We can assume that |A + |, |A~| = ^n + o(n), as this occurs with probability 1 — e~ 9 ^ for any 
g(n) = o(n). 

Let H(0) = H and define H{i + 1) to be the hypergraph obtained by removing every vertex 
in H(i) that is not essential for any hyperedges, along with all hyperedges in which that vertex is 
non-essential. We call this operation a parallel round of the *-core process. We begin by analyzing 
H(i) for constant i, using Model A from section [10.11 

We let p\ ,pl denote the probability that a vertex v S A + ,A~ survives the i parallel rounds; 
i.e. Pr(u G H(ij). Initially Pq = Pq = 1; it will follow by induction that pf = p^~ + o(l). So we 
will recursively define pi and show that pf = p~ = pi + o(l). 

Consider any vertex v. Note that v £ H{i + 1) iff there is at least one hyperedge / in which v 
is the essential vertex and every non-essential vertex is in H{i). Lemma 19.11 implies the following 
key property: 
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Property 11.1. For every vertex v, the expected number of hyperedges in which v is essential is 
a + o(l). 

Consider any hyperedge e in which v is the essential vertex. Let the other vertices be ui, u^-i- 
By induction, Pr(uj G H(i)) = p% + o(l) for each 1 < j < k — 1. W.h.p. F is locally tree-like; 
in particular v does not lie within distance i of a cycle of length at most 2i. Prom this, it is 
straightforward to show that these k — 1 events are nearly independent, and so Pr(ui, € 
H(i)) = p\~ l + o(l). Furthermore, Property II 1.11 and the fact that w.h.p. v does not lie near a short 
cycle imply that the expected number of hyperedges in which v is essential and all non-essential 
vertices are in H (i) is Aj + o(n) where 

Xi = a.p\~ x . 

A similar straightforward expected number calculation shows that for any t > 0, the expected 
number of i-tuples of such hyperedges is A* + o(l); again, the key point is that if there are no 
nearby short cycles, then the hyperedges occur nearly independently. So the Method of Moments 
(see e.g. [31]) implies that the number of such hyperedges is distributed asymptotically as a Poisson. 
In particular, the probability that there is at least one is pi + \ + o(l) where 

Pi+x = l-e- Xi = l-e- a Pi~\ 
In other words pf +1 ,p^ + i = Pi+i + o(l), thus completing the induction. We define 

• Xf ,X~ is the number of vertices of A + , A~ in H(i); 

• Y ri + is the number of hyperedges in H(i) whose essential vertex is in A + , A~; 

• Af,A~ is the number of vertices of A + , A~ in H(i) that are not essential in any hyperedges 
offf(i); 

• Bf , B~ is the number of vertices of A + , A - that are essential in exactly one hyperedge of 
H(i). 

By the above calculations, Exp(A l + ), Exp(X~) = ^Pi n + o(n). Since every hyperedge has exactly 
one essential variable, those calculations yield Exp(Y~; + ), Exp(Yp) = \\n + o(n). Af,Aj count 
the vertices that are in H(i) but not in H(i + 1); so 'Ex.p(Af), Exp(A^) = — p i+ i)n + 
o{n). Since the number of edges in which v is essential is asymptotic to a Poisson with mean Aj, 
Exp(Sf),Exp(flr) = \\ie-^n + o(n). We will prove below that these variables are all highly 
concentrated. 

Lemma 11.2. For any fixed i > 0, and any constant e > 0, there exists n = n(e, a, i, T): 
\pin\ > en) < e~ vn , Pr(\X~ - \p{n\ > en) < e~ vn 
\\n\ > en) < e~^ n , Pr(\Yr - \\n\ > en) < e^ n ; 
\{pi - p i+ i)n\ > en) < e~^ n , Pr(\A~ - \{pi - p i+ i)n\ > en) < e~^ n ; 
\\ie-^n\ > en) < e^Pr^r _ ±\ ie - x *n\ > en) < e~^ n . 



(a) Pr(|X+ - 

(b) Pr(\Y+- 

(c) Pr(|A+ - 

(d) Pr(|S+- 
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We defer the proof to the end of this appendix. 

We let p = linij^oo pj , which exists since is positive and decreasing. So p must satisfy 
p = 1 — e~ ap . Setting A = lim^oo Aj = ap , we obtain: 

p = 1 - e~ A ; so A = a(l - e~~ X ) k ~ l ; so a 



(l-e-A)fc-r 
We will prove: 

Lemma 11.3. For any g(n) = o(n), with probability at least 1 — eT 9 ^ : 

(a) If a < otk then the *-core ofT(F,a) has o(n) vertices. 

(b) If a > otk then the *-core ofT(F,o~) has 

(i) \pn + o(n) vertices in each o/A + ,A~; 

(ii) \\n + o(n) hyperedges with essential vertices in each o/A + ,A~; 

(Hi) |Ae~ A n + o(n) vertices in each o/A + ,A~ that are essential in exactly one hyperedge. 

Recalling that = inf x >o ^_ e - x ^k-i ? we have that for a < ak,p = 0. For a > a(k) we define 
scjfc(a) to be the maximum x > such that a = ^ 1 _ e x x ^ k -i ■ It is straightforward to show that 

Xfc(a) < 1, A = Xfc(a) and p = 1 — e . Thus Lemma 111.31 implies Lemma 14.31 and Lemma ll0.3f a). 
Also, Lemmas 111.21 ITL31 imply Lemma 15.21 as follows: 

Proof of Lemma 1 5.2c For any e > we can choose I such that pi < p + e. The number of 
vertices outside of the *-core with *-depth greater than / is Xf + XJ minus the size of the *-core, 
and hence is less than en. This proves the lemma with L = I. □ 

We will require the following bound: 

Lemma 11.4. For any a > a& there exists 7 > such that Ae _A < (1 — r y)p/(k — 1). 

Proof. Let x\ be the value of x > that minimizes ^ 1 _ e x x ^ k _ 1 ■ It is straightforward to check that 
for a > ctk we have Xk(at) > x\. Differentiating, we see that 

(1 - e- Xl ) k ~ 1 — (k — l) Xl e- Xl (l - e~ Xl ) k - 2 = 0; so 1 - e~ Xl = (k - \)x x e~ x \ 

Clearly l ~^- x = \{e x — 1) = (1 + § + ...) is increasing with x. So for x > x\ we have l ~^- x > k — 1 
which yields the lemma since A = Xk(ct),p = 1 — e~ Xk ^ a \ □ 

Note that Lemmas 111.31 [TT741 imply Lemma 110.3( b), 

Proof of Lemma \ 11.31 We will choose a small constant £ > 0. Lemma lll.2l implies that we can 
choose I sufficiently large that, with probability at least 1 — e~ 9 ^: 

{\p-C)n < X+,XJ < (^p+C)n; Y+,Yf < (^A+C)n; A+,AJ < ±(n; B+,BJ < (±\e 

Recall that the order in which vertices are removed during the *-core process does not affect 
the outcome. So we can remove them as follows: First, we carry out / parallel rounds. Then we 
remove vertices that are not essential in any edges one-at-a-time in arbitrary order; eg. we can pick 
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one of the removable vertices uniformly at random, or we can choose the removable vertex with 
the lowest label. 

After the I parallel rounds, we expose the vertices that remain, W, the number of hyperedges 
that remain, Yj and for each remaining hyperedge / we expose its essential vertex, ess(/). The 
following observation allows us to analyze H(I) using the Essential Model (section HO.ip , 

Observation: Consider any two hypergraphs J7, $7' on the same subset of the vertices of H, 
with edge set {ei,...,e^} and {e'i, e' e }, such that: for each 1 < j < I, the hyperedges £j,e'j have 
the same type and the same essential vertex. Then 0, 0' are equally likely to be H(I). 

To see this, let R be any hypergraph such that applying / iterations of the parallel process 
to R yields VL. Form R' from R by replacing every edge of R that is in £1 by the corresponding 
edge from 17'. Then applying / iterations of the parallel process to R' will yield f2'. Furthermore, 
Pr(r(F, a) = R)= Pr(T(F,a)) = R' . 

Note that this observation allows us to model H(I) using the Essential Model. So we expose 
the vertices of H(I), and for each hyperedge e € H{I) we expose the essential vertex of e. We let 
H± denote the set of vertices that are essential in exactly one hyperedge; so = Bf + BJ < 
(Xe~ x + 2()n. 

From here, the analysis is nearly identical to that from the proof of Lemma ll2.111 

Our first step will be to expose the type of every hyperedge; recall that we choose these types 
independently and the probability that a hyperedge with essential vertex in A s has type r is w s (t). 
For each vertex x £ Hi, if the type of the hyperedge in which x is essential (s,a,b) then we say 
a(x) = a, b(x) = b. We set A = Y^xeHr a ( x ) an< ^ B = X^effi b(x). As in the proof of Lemma [12JTJ 
with probability at least 1 - e~ 9 ^ we have A,B = \Hi\(\(k - 1) + o(l)). 

As we remove vertices one-at-a-time from H(I), we let L denote the set of removable vertices 
that remain. So initially, \L\ = Af + Aj < Qn. At each step, we remove a vertex w from L. For 
each hyperedge / containing w, if the essential vertex ess(/) is in Hi then we add ess(/) to L. 

We carry out up to — n steps. If we do not reach the *-core before that time, then we must 

have added a total of at least ^-n — Qn vertices to L during those steps. 

To determine which vertices are added to L we expose the following information: For each 
remaining hyperedge /, we ask whether w is a non-essential vertex of /. If it is, then we delete / 
and place ess(/) into L if ess(/) € H%. If w is not in /, then we do not expose the non-essential 
vertices of /. 

Suppose w € A + . As in the proof of Lemma 112.111 it is easy to compute that the vertices of 
Hi that will be added to L are determined by at most Hi independent trials of total probability 
at most 

A (iAe- A + C)n 1 

^ T f^ < (^-C)n-fn <1 "2 7 ' 

for £ sufficiently small, by Lemma lll.4i Similarly, if w € A~ then we have at most Hi independent 
trials of total probability at most 1 — ^7. 

Summing over the first ^-n iterations, the total number of vertices added to L is upperbounded 
in distribution by the sum of ^-n x Hi independent trials, each with probability 0(n _1 ) and with 
total expectation ^n(l — ^7) = ^-n — 2Qn. Standard concentration results for binomial variables 
yield that the probability that they total more than ^-n — £n is at most e~ Sn for some 5 > 0. 

So for every C > 0, there exists 5 > such that with probability at least e~ Sn , we halt within 
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steps. If we halt within that many steps then the number of vertices of A + that are in the *-core is 
between Xf and Xf — -^-n and hence is within ^j-n of \pn. Since we can take £ arbitrarily small, 
this implies that for any g{n) = o(n), the number of such vertices is i^pn + o{n) with probability at 
least l-e'sW. The same argument applies to the other parameters, thus proving Lemma lll.31 □ 

It only remains to prove our concentration lemma: 

Proof of Lemma We will apply Azuma's Inequality |15j which implies (see eg. [14]) that 

for any random variable Q = Q(H) = 0(n), if changing the vertices of one of the an hyperedges 
in H can change Q by at most an additive constant, then Pr(|Q — Exp(Q)| > en) < e~®( n \ 

We start with the concentration of Xf. Note that whether v is counted in Xf is determined 
entirely by the subgraph induced by N l (v), the set of vertices within distance i of v. In an extreme 
case, changing a single hyperedge / can affect Xf by a lot, if / is within distance i of many vertices. 
So we fix a large constant D and define: 

D = the set of vertices that are within distance i of a vertex u of degree > D, 
X^(D) = the number of vertices of A + \ that are in H(i). 

Changing a single hyperedge can affect Xf (D) by at most 2k((k — 1)D) % = O(l). Indeed, if 
changing the vertices of / affects whether v 6 Xf then v is connected to one of the old or new 
vertices of / by a path of length at most i. If any vertex on a hyperedge of that path has degree 
greater than D then t)£fj) and so v will not count towards Xf(D). So each of the 2k old or new 
vertices of / can affect at most ((k — l)D l ) vertices v. Therefore, there exists n\ = rjx(D,e,k,i,T) 
such that 

Pr(\X+(D) - Exp(X+0D))| > \en) < e~^ n . 

A standard property of random graphs (and indeed an easy calculation) yields that by taking D 
sufficently large, we can make Expd^^l) an arbitrarily small multiple of n. (Roughly: the expected 
number of vertices u of degree greater than D drops exponentially in D while the expected number 
of vertices within distance i of each such u is linear in D, for fixed i.) So we choose D such that 
Exp(|#d|) < \en. 

Next we show that \^d\ is concentrated. A similar argument to that above shows that changing 
the vertices of a single hyperedge / can affect \^d\ by at most 2k((k — 1)D)' = O(l). Indeed, if 
changing / affects whether v £ then v is connected to one of the old or new vertices of / by a 
path of length at most i. If any vertex on the hyperedges of that path has degree greater than D 
then DEffl regardless of the choice of /. So each of the 2k old or new vertices of / can affect at 
most {(k — l)D r ) vertices v. Therefore, there exists 772 = ^(D, e, k, i, T) such that 

Pr(|*D - Exp(|tfr,|)| > \en) < e~ mn . 



Noting that Xf(D) < Xf < Xf(D) + \^d\ and applying linearity of expectation, we have 

Pr(|X+ - Exp(A+)| > en) < e"" 1 ™ + e"" 2 " < e"^, 

for any n < 771,772- The proof for the remaining parameters is nearly identical. □ 

We close this section by noting that by the same reasoning as for the Observation in the proof of 
Lemma [11.3l we can model the *-core of T(F, a) using the Essential Model. We do so in section [T2l 
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12 Frozen variables in the *-core 



Here, we prove Lemma 110.1( a). Recall that (F,a) is drawn from P(T,n,M = rn) where T is 
symmetric and 1-essential. 

H* is the *-core of T(F,a), so every edge of H* has exactly one essential vertex and every 
vertex is essential for at least one edge. Hi is the set of vertices that are essential in exactly one 
hyperedge of H*. We need to show that, with sufficiently high probability, all but o(n) vertices in 
H * are fin- frozen variables of (F, a) . 

Definition 12.1. For each vertex x 6 Hi, we use e(x) to denote the unique hyperedge of H* in 
which x is the essential vertex. 

Definition 12.2. A flippable set of H* is a set of vertices S C H* such that for every x € S and 
for every hyperedge f € H* in which x is essential, S contains another vertex of f . 

Note that, since every hyperedge of H* has exactly one essential variable, that other vertex is 
not essential for /. 

Given two boolean assignments o~, a' to the variables of F, we let aAa' denote the set of variables 
x for which a{x) ^ a'{x). 

Proposition 12.3. (a) If a' is any solution of F, then (aAa') n H* is a flippable set. 

(b) The union of any collection of flippable sets is a flippable set. 

Proof. For (a): if (aAa') n H* is not a flippable set, then there is some x G (aAo~') n H* and a 
hyperedge / € H* such that x is essential for / and aAa' contains no other vertices of /. Since 
H* C T(F,a), this means that x is essential for the constraint corresponding to / in (F,a), and 
that a' changes the value of x but not of any other variables in /. Therefore a' violates / and so 
a' is not a solution for F. 

For (b): this is immediate from the definition of a flippable set. □ 

To prove Lemma 110.1( a). we will show that there exists <f>'(n) = o(n), ( > such that, with 
sufficiently high probability, there are no flippable sets S in H* of size 4>'(n) < \S\ < Qn. We will 
apply a first moment bound. A direct approach does not work, because of a "jackpot phenomena": 
The existence of a flippable set S typically implies the existence of an exponential number of other 
flippable sets formed by adding to S variables x S H\ with the property that e(x) contains a 
member of S. To overcome this issue, we focus instead on sets with the following property. 

Definition 12.4. We say that a set A C H* \ Hi is weakly flippable if there exists P C H\ such 
that AL) P is flippable. A is ^-weakly flippable if there exists such a P with \P\ < ip. 

Given a flippable set S C H* , we consider a directed graph D(S) C D. The vertices of D(S) 
are the vertices of S; the edges of D(S) are defined as follows: 

• For each x G Sn Hi, we choose one other variable x' S e(x) that is in S, and we add the edge 
x — > x' to D(S). 

Note that, since S is flippable, there is at least one such x'. It is not important which one we choose, 
but to be specific we could, eg., choose the lowest indexed variable from amongst all variables of S 
(other than x) in e(x). 

Thus, every vertex in D(S) has outdegree either or 1. We define: 
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• As = S \ H\. Note that As is the set of vertices with outdegree in D(S). 

• Cs is the set of all vertices on directed cycles of D(S). Note that those directed cycles are 
disjoint since the maximum outdegree is 1. 

Since the outdegree of every vertex outside of As is one, there is a directed path from every 
x G S \ (A s U Cs) to A s U C s . 

Definition 12.5. A set of vertices xi, . . . ,x\ G H\ is cyclic if for some permutation ir G Si, x^u\ 
is in e(xj) for every 1 < i < a. 

Definition 12.6. Given a set A C H* , the closure of A, cl(A) is the set of all vertices x such 
that, either 

(a) x G A, or 

(b) x G Hi \ A and there is a sequence x = xq,xi, ...,xe where (i) xi G A and (ii) for all i < I: 
Xi G Hi \ A and Xi+i G e(xi). 

Proposition 12.7. If S is a flippable set, then: 

(a) As is weakly flippable. 

(b) Cs is cyclic. 

(c) SCcl(AsUCs). 

Proof, (a) follows from the definition of weakly flippable, with P = S n Hi. 

(b) follows from the definition of cyclic, where the directed cycles of D(S) form tt. 

For (c), if x G S \ (As U Cg), then x G Hi and the directed path from x to Ag U Cs in D(S) 
indicates that x satisfies condition (b) of Definition 112.61 □ 

Lemma 12.8. Suppose that for some <f>, </>', ip we have: 

(a) There is no ip-weakly flippable set A C H* \ Hi such that <p < \A\ < tp. 

(b) There is no cyclic set C such that <f> < \C\ < ip. 

(c) There is no set A such that \A\ < 2(f) and \cl(A) | > <fi' . 
Then there is no flippable set S C H* such that <f>' < \S\ < ifj. 

Proof. We apply Proposition 112.71 Let S be a flippable set with \S\ < ip. Then As is ^-weakly 
flippable. Thus, by (a), \As\ < 4>. Since Cs is cyclic and \Cs\ < \S\ < tp, (b) implies \Cs\ < <f>- 
Therefore, \As U Cs\ < 2cj), which by (c) implies that \S\ < |cl(^45UC5)| < <f>' '. The lemma 
follows. □ 

The following lemmas establish that the conditions of Lemma 112.81 hold with sufficiently high 
probability. 

Lemma 12.9. There exists £ = C(Y,a) > 0, and for any g(n) = o(n), there exists (j)(n) satisfying 
g(n) « <p(n) = o(n) such that: 

The probability that there is a (Qn)-weakly flippable set A of H* with <p(n) < \A\ < Qn is at most 
e -ff(«). 
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Lemma 12.10. There exists £ = £(Y,a) > 0, and for any g{n) = o(n), there exists 4>(n) satisfying 
g(n) « 4>{n) = o{n) such that: 

The probability that there is a cyclic set C in H* with <p(n) < \C\ < Qn is at most e~ 9<yn \ 

Lemma 12.11. There exists C, = £(Y,a) > 0, and for any 4>{n) = o(n), there exists 4>'{n) = o(n) 
such that: The probability that there is a set A C H* with \A\ < 20(n) and \cl(A) \ > <j)'(n) is at 
most e~^ ( n ) . 

These lemmas yield Lemma llO.l( a) as follows: 

Proof of LemmaWJXa): Note that we can take 4>'{n) > g(n). Lemmas TIM [LT9l 112.1UI 112.111 
imply that for all g(n) = o(n), there exists 4>'(n) = o(n) such that with probability at least 
1 - 3e~9(n) the * -core H* of T(F,a) has no flippable set of size between <p'(n) and Qn. So suppose 
that there is no such flippable set in H*. 

Let Si, St be all flippable sets in H* of size less than £n. Thus each \Si\ < <p'(n). Assume by 
induction that | U J i=1 Si\ < <p'(n). Then | U^ 1 Si\ < 2<f/(n) < (n. By Proposition ED^b), U^S* 
is a flippable set and hence it must have size less than <f)'(ri). Therefore | U* =1 Si\ < <j)'(n). 

Now consider any sequence of solutions a = gq , ai , . . . , an in which the assignment changes for 
at least one variable in H* \ (U* =1 5i). Let i be the lowest index so that (Ti(x) ^ a(x) for some 
x 6 H* \ (U| =1 5i). Therefore x G (cijAcr) n H* which, by Proposition 112. 3( a). is a flippable 
set. Since x £ \j\ =l Si, this implies |(<7jA<r) n H*\ > £n. By our choice of i, |(o"jAcrj_i) n H*\ > 
\(aiAo~)r\H*\ — |U* =1 5j| > (n—(p'(n). Therefore every variable in fl"*\(U* =1 5j) is (C n ~ 0'(n))-fi"ozen. 
This yields Lemma llO. 1( a) for any j3 < £ after rescaling g(n). □ 

We prove Lemmas 112.91 112. 10[ 112.111 in the next three subsections. In each case, we will study 
H* using the Essential Model. See the discussion at the end of Appendix [TT] explaining why it is 
valid to do so. 

12.1 Weakly-flippable sets: Proof of Lemma 112.91 

Suppose that A C H* \ Hi is a (£n)-weakly flippable set with <j>(n) < \A\ < (n. 

Set a := \A\ and note that there are at least 2a hyperedges of H* whose essential variables are 
in A, since A contains no variables of Hi. Let ex,..., e^ a denote exactly 2a such hyperedges; to be 
specific, the 2a with the lowest indices. Since A is (£n)-weakly flippable, for each 1 < j < 2a there 
exists a sequence of vertices Xj t o, xj t x, ...,Xjjj such that: 

(i) Xjfl € A is the essential vertex of ej ; 

(ii) Xj itj € A; 

(iii) Xj t x G ej, and if tj > 1 then for each 1 < i < tj — 1: Xj^ £ Hx and Xjj+x G e ( x j,i)- 

Note that possibly tj = 1 in which case ej contains a non-essential member of A. 

These sequences are not necessarily disjoint. However, since ej ^ eji for all j ^ j' , we can take 
initial portions of them so that the portions in Hi are disjoint. I.e., there exist lx, ■ ■ ■ , ha ^ with 
Y^j=i h — C n such that 

(i) the vertices Xji : 1 < j ' < 2a, 1 < i < h are distinct; 

(ii) for j = 1, . . . , 2a: a^.+i G A U {xj> }i : 1 < / < j, 1 < i < lj>}. 
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We will bound the expected number of such collections of sequences, when H* is chosen from the 
Essential Model. So we expose the vertices of H* , and for each hyperedge ex, e an we expose the 
essential vertex of e, . By Lemma 110.3( a) we can assume that \H*nA + \,\H*r\A+\ = ^\H*\ + o(n). 

Fix some < I < C,n. First we will choose lx,..-,ha — summing to I. The number of choices 

is (^a-l 1 )" 

Next, we choose A; note that this determines ex,..., &2a and their essential vertices a^o, ■ X2a,o- 
The number of choices is (j H a ') < (") . 

Next we choose the remaining vertices. To do so, we first determine their signs; i.e. which are 
in A + and which are in A~. So for each j, we choose a pattern 8j - a sequence of U + 2 terms from 
{+, — } indicating the signs of Xj^, ...,Xjj-+x. Note that, since Xj t o is already determined, the first 

sign of 9j is already known. Recall from Lemma [10.31 that we can assume l-f^l, \H±\ < \H*\ x izr- 

Thus, given 9j, the number of choices for Xj t \, ...,Xjj. is at most (\H*\ x j^f) j ■ 

Finally, we choose Xj^ j+ \ : 1 < j < 2a. These are not neccesarily distinct, and they are all 
members of A U {xj^ : 1 < j < 2a, 1 < i < lj}. So the number of choices is at most (a + £) 2a . 

Having selected these vertices, we bound the probability that all edges are as required. 

Consider selecting the type of a hyperedge whose essential vertex is in A + ; thus we are select 
type r = (l,a,b) with probability u> +1 (t). We let g = g(T) denote the expected value of a, and so 
the expected value of b is k — 1 — g. Because T is symmetric and \H* D A + | = \H* D A~|(l + o(l)), 
it follows that w +1 (l, a, b) = z« _1 (— 1, b, a) + o(l). So when we select the type of a hyperedge whose 
essential vertex is in A - , the expected value of b is g + o(l). 

Now we select the types and then the non-essential vertices for the hyperedges e\, ...,e2a and 
e(xj t i) : 1 < j < 2a, 1 < % < lj. Recall that we choose those vertices uniformly from A + n H* or 
A - n H* depending on what the type of the hyperedge tells us the sign of the vertex should be. For 
each j, we require that Xjx is a non-essential vertex of ej and that Xji+\ is a non-essential vertex 
of e(xj,i). By Lemma [T073T a) . |A+ n H*\, |A _ n H*\ = \H*\{\ + o(l)), and so for each hyperedge 
this event occurs with probability 29 ^j*^ if the essential and non-essential vertices have the same 

sign, and 2 ^~ 1 |^| + °^ 1 ^ otherwise. Note also that these events are independent. 

We let y{6j) denote the number of terms in 9j that are the same as the preceding term. So the 
probability that the required vertices are selected as non-essential vertices in each hyperedge is: 

2g + o(l) \ E ?= 1 ^ {2(k-l-g) + o(l) N E ?= 1 



\H*\ J V 1-^*1 

Putting this all together yields that the expected number of (Cn)-weakly flippable sets A, given 
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a, £, is at most: 




For each value of y, there are (^y 1 ) patterns 6j with y(0j) = y, since the first sign in 9j is 
already chosen. This implies 

naxy^* - 1 - = n e ( lj + X W - 1 - = (* - ^ 

i=l ^ j = l y=0 ^ ^ ' 

So (J7J) is at most 

for some constant C > 9e 3 (p^) 2 (see Lemma [43]). So the total expected number of A with 
4>(n) < \A\ < £n is at most: 

a=<f>(n) V 7 £>0 V 7 

To bound this, it is easy to see that (l + |) 4a (1 — !£f is maximized at i = 0(a) and hence is at 
most y 4a (l — ^) 4a < y 4a for some constant y = 1^(7). Since 1 — 7 < (1 — ^) 2 , this yields an upper 
bound of: 




By taking ( < 2( } Y i > this is less than Y^^Mn) ^ ° which is ^ ess than e 9 ^ n ^ for any 4>(n) » g(n). 



12.2 Cyclic sets: Proof of lemma [T2TI0] 

Note that the vertices of a cyclic set are partitioned into cycles by the permutation ir. We will fix 
a constant Z, to be named later. A small-cyclic set is a cyclic set in which each cycle has length 
at most Z. A large-cyclic set is a cyclic set in which each cycle has length greater than Z. 
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Lemma 12.12. For any g(n) = o(n) there exists 4>(n) such that with probability at least 1 — e~ 9 ^ : 

(a) H* has no small-cyclic sets of size at least ^4>(n). 

(b) H* has no large-cyclic sets of size at least ^(/>(n). 

This clearly proves Lemma 112, 1UI as any cyclic set of size at least 4>(n) contains either a small- 
cyclic set or a large-cyclic set of size at least ^4>(n). Again, we work in the Essential Model. 

Proof of (a): We say that a cycle in F(F, a) is a set of vertices x\, ...,xg such that Xi, Xi+i lie in 
a common hyperedge of F(F,o~) for each i (addition is mod £). For any Xi,xj, the probability that 
Xi, Xj share a hyperedge in F(F, a) is less than c/n, for some constant c = c(Y, a). 

If H* has a small-cyclic set of size at least ^4>(n), then the hypergraph F(F, c) must contain at 
least <f>(n)/(2Z) cycles of size at most Z, and so it must contain at least 4>(n) / '(2Z 2 ) cycles of size 
exactly z for some z < Z. Setting W := cp(n)/(2Z 2 ), the probability of this occurring for z is less 
than: 

n zw , c yw (c«)W ! 

Wl \n) W\ 2Z 

if 4>(n) » g(n). (Note that the dependency between the events that the zW pairs of vertices each 
share a hyperedge goes in the right direction for this bound to hold.) Summing over all z < Z 
proves (a). □ 

Proof of (b): We will bound the expected number of large cyclic sets of size a. 

A pattern 6 is a sequence of a terms from {+, — } indicating the signs of x\,...,x a . By 

Lemma 110.31 we can assume that < -|— f|i7*|. So for any pattern 9, the number of 

/I_ 7 \ a 

choices for x±, ...,x a is at most ( j^i\H*\ I . 

Given a pattern and a permutation tt, we let y(6, ir) denote the number of i such that Xi,x n u\ 
have the same sign. Recall g from the proof of Lemma fl 2.9} the same reasoning as in that proof says 
that, for any choice of greement with 9, the probability that x n ^ is a non-essential 

, . f w . . f 2g+o(l) \y( e ri ( 2(k-l-g)+o{l) \ a -y^^ 

vertex m e{Xi) tor every i is I T[pri J ( — J 

We let c(tt) denote the number of cycles in 7r; since we are considering large-cyclic sets, we only 
need to consider permutations tt with c(tt) < a/Z. For any tt, y, the number of choices of 9 with 
y(0,ir) = y is at most 2 c( - 7r )(^) < 2 a l z (y)- Indeed, there are choices of the values of i for which 
Xi, x^u\ have the same sign; given one such choice, the pattern is determined by fixing the sign of 
one vertex in each of the c(ir) cycles. Note that this is an upper bound; as for some tt, y, parity 
conditions will imply that there is no such 0. 

To bound the expected number of large-cyclic sets of size a, we sum over all ordered choices of 
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xi, ...,x a and all choices of tt, and then divide by a!, obtaining: 



1 V- ^-7 lir ,AV ( 2 9 + o{l)\ ^ (2{k-l-g) + o(l)^ a ~ y ^ 



a\ ^ \ k - 1 1 1 / ^ \ Iff*! /V Iff 



< iH)"s£E^(><*-i-sr» 



if Z is chosen large enough that (1 — j)2 1 ^ z < (1 — ^7). 

So the probability that there is a large-cyclic set of size at least ^4>(n) is at most 0(1)(1 — 

± 7 )§^ n ) < ± e -9(n) for any 0( n ) >> 5 ( n ) □ 



12.3 Closure: Proof of lemma [12JJJ 

We will choose 4>'(n) >> (p(n). Again, we work in the Essential Model. 

Consider a set A of size at most 20(n) = o{<f)'{n)). We can find cl {A) using the following search: 

1. Initialize C = 0,L = A. 

2. While L ^ 

(a) Choose u £ L. 

(b) For every w £ H\\(C U L) such that u G e(tu), add w to L. 

(c) Remove tt from L and add u to C. 

When this procedure halts, C = cl(A). Note that |cl (A) | is the number of times that we 
execute the loop in Step 2. If |cl (^4) | > 4>'(n) then during the first <j)'{n) iterations we never reach 
L = and so we must add a total of more than <p'(n) — \A\ = 4>'(n)(l — o(l)) > </>'(n)(l — ^7) 
vertices to L in Step 2(b), where 7 comes from Lemma [10.31 We will bound the probability of that 
occuring. 

We analyze ff* using the Essential Model. So we expose the vertices of ff*, and for each 
hyperedge ei,...,e an we expose the essential vertex of e^. By Lemma 110,3 ( a) we can assume that 
|ff*nA+|,|ff*nA+| = l\H*\ + o(n). 

Our first step will be to expose the type of every hyperedge; recall that we choose these types 
independently and the probability that a hyperedge with essential vertex in A s has type r is w s (r). 
For each vertex x G ffi, if the type of x is chosen to be (s, a, b) then we say a(x) = a, b(x) = b. We 

set A = £ xeJTl a ( x ) and B = Exetfi K x )- 

Because T is symmetric and [ff* n A + | = [ff* n A~|(l + o(l)), it follows that w +l (l,a,b) = 
1, b, a) + o(l). This implies that for x G A + ,y G A - , Exp(a(x)) = Exp(6(y)) + o(l) and 
Exp(6(x)) = Exp(a(y)) + o(l), and it follows that Exp(^), Exp(ff) = |ffi|(|0 - 1) + o(l)). The 
number of hyperedges of each type is a binomial variable and so is easily seen to be highly enough 
concentrated that with probability at least 1 — e~ 9 ^ we have A,B = \Hi\(^(k — 1) + o(l)). 
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Now we analyze our search. We can choose u arbitrarily in Step 2(a); to be specific, we choose 
the u E L with the lowest index. To carry out Step 2(b): for every x G Hi \ (C U L), we expose 
whether u G e(x); if u £ e(x) then we do not expose the non-essential vertices of e(x). 

Suppose u G A + . To test whether u G e(x), we ask whether u is one of the a(x) non-essential 
variables from A + . Initially, the probability is ' as P rocecm re progresses, this increases 

as we have exposed that the members of C are not in e(x). But since \C\ < <p'(n) it never exceeds 
\H*n < A+\-<t>(n) • ^ e as ^ * n ^ s f° r every x G Hi \ (C U L) resulting in at most Hi independent trials of 
total probability at most 

A \Hi\{±(k-l) + o{\)) 

< 1 - 7, 



\H* n A+| - <t>{n) " \\H*\(l + o(l)) 

by Lemma HU.3f b). Similarly, if u G A~ then we have at most Hi independent trials of total 
probability at most 1 — 7. 

Summing over the first (f>'(n) iterations, the total number of vertices added to L is upperbounded 
in distribution by the sum of <p'(n)Hi independent trials, each with probability 0(n -1 ) and with 
total expectation 4>'{n){l — 7). Standard concentration results for binomial variables yield that the 
probability that they total more than <j)'(n)(l — ^7) is at most e~ c< ^'^ for some c = c{g,k,j). 

So the expected number of sets A of size at most </>(n) for which |cl (A) \ > 4>'(n) is at most 



£ 

a=l 



a J \<P( n ) J \ <K n ) / 

By choosing cj){n)\og{n/<p{n)) « 4>'{n) = o(n), this probability is less than e~^ n \ as required. □ 



34 



